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The Information Matrix in Latent- Variable Models 

Abstract 

The information matrix for the parameters in a latent - 
variable model is bounded from above by the information that would 
obtain if the values of the latent variables could also be 
observed. The difference is the "missing information." This 
paper discusses the structure of the information matrix, and 
characterizes the degree to which missing information can be 
recovered by exploiting collateral variables for respondents. The 
results are illustrated with data from the Armed Services 
Vocational Aptitude Battery. 



KEY WORDS: collateral information, item response theory, latent 
variables, missing information principle. 
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1 . Introduction 
Latent variable models are used in the social sciences to 
provide parsimonious descriptions of the associations among 
observable variables in terms of theoretically derived constructs. 

Let x - (x x )' denote observable variables, 0 denote latent 

variables, and f}^ denote parameters of the regressions of the x^s 
on 0 through the known functions f ^ (x^ | 8 . Under the 
assumption of conditional independence, 

f<x|«;0 ) - n f (x Ae.p^. 
j 

If g(d\0^) is the density function of 8 in a population of 
interest, then the density of the observed variables is given by 
the mixture 

h(x|/J) - J f(x|« f ^ 1 ) g(8\0 2 ) dS , 

where - ^i f ^2^' F ° r notationa - 1 - convenience, we shall suppress 
dependence on 0^ and ^ in f, g, h, and elsewhere. 

Some examples of latent-variable models follow: 

o An item response theory (IRT) model for an n-item educational 
test gives the probability that an examinee will respond 
correctly to item j--Xj-l rather than 0--as a function of (i) 
an unobservable scalar 8 characterizing the proficiency of 
the examinee and (ii) a possibly vector -valued parameter 



characterizing the regression of x^ on 8 (Lord 1980). In 
this case, there is a distinct subparameter tf„ . for each 
item, so that ^-W u ^ n ). 

o In factor analysis models (Thurstone 1948), the "factor 

loadings" 0^ of the observed variable x^ are coefficients of 
its linear regression on unobservable trait values 8. It is 
often assumed that g is a standardized multivariate normal 
density . 



o In latent class models (Lazarsfeld 1950), 0 implies response 
probabilities for items from members of classes 1 through K, 
but respondents' class memberships 9 are not observed. Their 
distribution g is multinomial, with parameters f}^ • 

This paper concerns the structure of information matrices 
that arise in the estimation of fi. If values of latent variables 
are construed as missing data (as in Dempster, Laird, and Rubin 
1977) 9 t he expected information matrix associated with estimating 

from values of x is bounded from above by the expected 
information that would obtain if values of 8 were observed as well 
(Orchard and Woodbury 1972). The difference is "missing 
information." We aim to characterize this loss, and to 
demonstrate how covariates, or collateral variables y for 
respondents, can be used to recover some of the missing 
information for /J The following section gives background and 
notation for latent variables models. Subsequent sections discuss 
expected information in maximum likelihood (ML) estimation of £ 
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from x, then in estimation from x and y. Results are then 
presented for observed information matrices. Finally, a numerical 
illustration with data from the Profile of American Youth (U.S. 
Department of Defense 1982) is given. 

2. Background and Notation 
In the terminology of Dempster, Laird, and Rubin (1977), 
estimating ft from a sample of N independent observations of (0,x) 
is a "complece-data" problem. The loglikelihood for @ is 

N 

A QX - S In [f(x |* 0 ) g(0.|/? 2 )] . (2.1) 
i-1 

Assume that both f and g are twice-differentiable , and define the 
gradient vector s(0,x) for f$ for the loglikelihood of a single 
observation by 

s(0,x) - [s 1 (^x),s 2 (^,x)] 

- [|^ In f(x|^,^ 1 ), |^ In g(*|/3 2 )]. (2.2) 

A 

Under regularity conditions, the MLE £ solves the likelihood 
equation 

N 

0 - S s(0. ,x.) 
i-1 1 1 

and for large N is approximately multivariate normal in repeated 
samples, with mean ft and covariance given by the inverse of the 



expected information matrix N I~ v . Using the fact that E (s)«0, 

yx *~ xy 

we further our purpose by writing I in the following manner: 

yx 



I QX - JTs(0,x) s'(tf.x) p(tf Jx.jS) d0 h(x|/3) dx 
[where p(0|x,/3) is obtained by Bayes theorem as f (x| 9 )g(0)/h(x) ] 



- E x [E Q (ss' |x)] 



- Var xe (s) 



Let lL_ - Var (sS denote the block of the information matrix 
©X X0 

that pertains to , and note that (2.2) implies that the off- 
diagonal block of I QX for elements of fi^ and those of f)^ is zero. 

Suppose that collateral variables y such as educational or 
demographic status could also be observed for respondents, and let 
p(y|7) denote their density in a population of interest. In an 
extension of conditional independence, it is desirable to posit 
that 

£(x\O t 0 v y) - fCxj*.^) ; (2.3) 

that is, 9 also explains the associations among observed responses 
x and collateral variables. (See Thissen, Steinberg, and Wainer 
1987 on how this assumption can be tested in the context of IRT.) 
When (2.3) holds, the joint density of (x,0,y) is 



iu 



P(x,^y|^ 1 ^ 2 ,7) - f(x|*,0 L ) g(e\y,P 2 ) ?(y|7) 



(Note the slight change in the meaning of g; in a similar manner, 
h becomes h(x|y,0) when collateral variables are present.) As 
long as and 7 are distinct, the loglikelihood induced by 

observing y along with 6 and x is equal to (2.1) plus a constant 
insofar as /3 is concerned. Moreover, 




(2.4) 



where 



I 9XY 



E Y E x E e 



(s s 



|x.y) 



Var Yxe (sl) 



Since the off-diagonal blocks of I exy for elements of 0^ with 
those of both f)^ anc * 1 are zer °. it can De concluded that 
observing y provides no additional information about 0^ if both 6 
and x are also observed. 

3. Estimating /3 from x 
Of course it is never possible to observe 0. But, since $ i 
missing for all respondents regardless of the values of $ and x, 
it can be considered missing data that is "missing completely at 
random," and appropriate likelihood and sampling distribution 
inferences follow by marginalizing over $ in the complete -data 
likelihood (Little and Rubin 1987). It is common practice to 



estimate /3 from values of x alone- - ignoring y even if it is 
available—by maximizing the " incomplete -data" loglikelihood 

A x => S In h(x.|/3) - In [E e (exp A ex |x lf . . . ,x N ;^)] . 

(In the context of IRT , Bock and Aitkin 1981 refer to this 
procedure as "marginal maximum likelihood" estimation.) Again 
under regularity conditions, the MLE solves what is now an 

"incomplete-data" likelihood equation 0-3 A Provided 

A. 

differentials can be passed through the integral, 



d\ d 

S — In h(x |/3) 

dp i dp 



- S / — [f(x |*,/3 )g(*|/3 )] h ml tx.\fi) d8 
i dp £ 1 



- S / — (In f(x |*,/3 )g(5|/3 )] f(x.|fl) g(0) h'^x.) d8 
dp 1 



- S J s(^ , Xi ) p(8\x it p) 6.8 

- S E e (s|x.) . 



Accordingly, the incomplete-data information matrix is N I v , where 



I x - E x [E e (s|x) E fl (s'|x)] - Var Y [E Q (s|x)] 
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the final equality using E [E (six) ]-0. I is related to I 

A " A "A 

through a decomposition of variance- -an instance of Orchard and 
Woodbury's (1972) "missing information principle:" 

I ex - Var x [E e (s|x)] + E^Var^s'jx) ] 

*e|X' t ^ le m ^ ss ^ n S information, is the average variance of the 
complete -data gradient vector given x but not 0; that is, 
variation in s over possible values of 9 that could give rise to 
observed data x, averaged over x. If the variance of p(0|x,/3) 
were zero for all x- -loosely speaking, if x determined 9 with 
complete accuracy- - then Var^(s|x) - 0 for all x, and no 
information would be lost as a result of not observing 9. If 
values of 9 are not completely determined by x, this variance 
increases and information about f) is decreased. The proportional 
decrease, from diagonal elements of I to those of I , need not. 

"X A 

be the same for all elements of f). 



4. Estimating 0 from x and y 
When collateral variables y are available for respondents, 
the extended incomplete-data loglikelihood is 

- S In (h(x t |/3) p(y.|7)l 
i 

- In [E e (exp A 0x |x 1( y 1( ,..,x N ,y N ;/2)] + 2 In p(yj7) • 



As long as £ is distinct from 7, the shape of the likelihood 
surface with respect to ft involves only the first term- -the 
conditional distribution of the xs , given the observed values of 
the ys. In particular, the likelihood equation for fi is 
where 



XY 



- S J s(0,x.) p(«|x lt y lf iS) 
i 

- E E e (s|x l? y l ) , 



d$ 



with p(0|x,y,0) « f(xj^,/3 1 ) g(«|y.0 2 ) / h(x|y,/3) . The asymptotic 
distribution of the MLE under repeated samples of N (x,y) pairs 
does involve the distribution of y, however. The block of the 
information matrix for (£,7) that pertains to 0 is N I , where 

- E Y E x [E e (s|x,y) E e (s'|x,y)] - Var YX [E e (s|x,y>] . 
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Large sample ML inferences about /3 under repeated sampling of 
(x,y)can be based on this block alone, since the off -diagonal 
block pertaining to .the crossing of /3 and 7 is zero. (Section 5 
concerns repeated samples of N (x,y) values with the ys fixed at 
prespecified values.) 

We now focus on the effect of including y upon information 
about fi^ . Expressions for the 1^ block suffice, since the nullity 
of the off-diagonal block in I implies the nullity of the 
corresponding block in I (see Appendix). Using (2.4) and 

A. 

applying the missing information principle, 

8 



J4 



\ 



1 1 1 
ex xy + ejxY 



where 



As in (3.1), the missing information corresponds to a loss in the 
precision with which /J^ can be estimated. The loss is expressed 
as expected variation of s^" over possible values of the latent 
variable $, conditional now on values of y as well as those of x. 

Intuitively, less information should be lost if y is observed 
along with x. An expression for how much of the missing 
information has been recovered begins with another decomposition 
of the total variation in s^". Since 



r ex " r exY - Va W sl) 



it follows that 



Iq X - Var^E^Cs 1 )] + E x {Var y [ E Q (s l ) } ) + E^Var^s 1 ) 



T 1 + T 1 + T 1 
" X X + X Y|X + ^jXY 



is the variance in expected values of s 1 over x, averaging over 

y and 0. lL y is the expected variance of the average values of- 
Y|X 

s 1 with respect to 9 as y varies. It represents variation in 



E Q (s ) explained by y beyond that explained by x. I 9| X y is tne 
expected variation in s^" remaining unexplained after both x and y 
have been accounted for. 

The portion of missing information about that is recovered 
by using y, then, is 1^. - 1^ « I^j ^- -another application of the 
missing information principle, with (x,y) treated as the complete 
data and x as the incomplete data. When y and $ are independent, 
this term is zero because for each x, E n (s^") takes the same value 
at all values of y. No information about is lost by ignoring y 
in this case. When y and 9 are not independent, the degree to 
which information about f)^ increases depends not simply upon the 
strength of their relationship, but on the strength of their 
relationship conditional on x. There is less to be gained by 
using collateral information when 0 is already well determined by 
x alone . 

These results indicate that greater benefit accrues frcm 
using collateral information as it relates more strongly to the 
latent variable, and as less information is available from the 
observed responses x. Mislevy's (1987) analyses in the context of 
item response theory indicate that in typical applications of 
educational and psychological testing, readily available 
collateral variables such as educational and demographic data can 
often account for a third of the population variance, and increase 
the precision of /J^ roughly as much as two to six additional test 
items. This gain is substantial in applications such as 
educational assessment or attitudinal surveys, where a subject 
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might be administered only five or ten items; it is potentially 
useful in adaptive testing, where he might receive fifteen well- 
chosen items. The proportional gain is not impressive with 
individual achievement tests, where test lengths of 60 to 100 
items are common. 

5. "Conditional Expected" Information 
The preceding sections concern information matrices that 
require marginalization over the sample spaces of both x and y. 
They reflect the point of view one has before observing either x 
or y values. This section presents results for expected 
information conditional on given values of y. 

Expected information conditional on y is pertinent to the 
problem of experimental design, for example. If it is possible to 
stratify on y when gathering data, expected information for 
various combinations of y values can be compared to choose an 
optimal sampling scheme. This requires expectations over x 
conditional on fixed values of y. Let y - ( v i» ■ ■ • ^ e a vector 
of N specified values of y. Define the mixture density 

N 

p (9\P 2 ) - N S g«?|y.,/J 2 ) , 
y i 

which represents the marginal density of 9 in samples drawn in 
accordance with y. The complete -data expected information matrix 
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that corresponds to this density is defined analogously to I in 

"X. 

Section 3, as 
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I 9X(y) " H S(8 ' X) S ' (9 ' X) Py^l X '^) d9 h y( X l^) • 



where 



h (x\p) - J fCxI*,^) p ($\0 2 ) dO 



and 



p ce\x t p) - fCxj*,^) P ce\p 2 ) / h (x\p) 



*8X(y) ^ S t * ie ex P ectec * information about 0 corresponding to 
repeated observations of (x,0) sampled in accordance with y. 
When, as in practice, observations consist of (x,y) , one can 
calculate expected information when estimation ignores y values 
and when estimation uses y values. These are, respectively, 



I x(y) - J [Js p y (6\x,(3)d0) [JV Py (0|x,/?)d0] h y (x|0) dx 



- E X(y) [Var 9(y) (s l x) ] (5 - 2) 

and 

1 N 

I Xy - N 2 ;[E e (s|x,y.) E 9 (s'|x.y.)] h(x|y.,0) dx 
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- E y E x|y tVar e (s ' x ' y)1 • (5,3) 

By the missing information principle, the gain in information 

about expected when exploiting y for this particular value of 

y, or I* - I* N , is at least positive semidef inite . 
7 Xy X(y) 

6. Observed Information 
The expected matrices discussed in the preceding sections are 
functions of the true values of 0. In practice, they ar sometimes 

A 

approximated by substituting maximum likelihood estimates P for 0. 
The resulting "estimated expected information matrices" are 
consistent estimates of the desired values. They are to be 
distinguished, however, from "observed information" matrices. 

An observed information matrix is the negative inverse of the 
second derivative matrix of the loglikelihood, calculated with 

A 

maximizing value p f the observed responses (x^ x^) , and, if 

required, (y^ , . . . ,y^) . Observed information reflects the point of 
view after N sampled values of both x and y have been observed, 
and indicates the precision with which 0 has been estimated from 
the realized sample (Efron and Hinkley 1978). In a large-sample 
normal approximation to the posterior distribution of p under 
Bayesian inference, the posterior variance is the negative inverse 
of the observed information. 

Define the complete-data second derivative d(0,x) as 
ds(9 t x)/dp' . Using Louis' (1982) expressions for observed 
information in missing data problems, 
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N I 



- 2 (J(-d) p(0|x.,£) d* 



- JW p(tf|x.,£) dt? 



and 



r xy ~ S ^ ( ~ d) PC'l^.yfi) - Jss' p(0|x.,y.,£) d* ] 



In contrast to results with expected information, the off-diagonal 

blocks of I and I for the crossing of p. and 0 O need not be 
a. xy 1 /_ 

zero. Moreover, the appealing decomposition of expected 

information into variance components of s does not carry over to 

observed information; it depended on the fact that E(ss ' )--E(d) . 

For unfortuitous combinations of x and y values, I can exceed I 

x xy 

in some or even all diagonal elements. Inasmuch as observed 

information is quite generally a consistent estimate of expected 

information, however, the results of Section 4 suggest that one 

would expect to find the greater diagonal entries in I more 

xy 

often than not. 



7. A Numerical Illustration 
This section illustrates the ideas developed above in the 
context of an item response theory (IRT) model for mental test 
data. The values of I 0 X(y) » X x(y) ' and I Xy*" t:ne ^ uantities 
relevant to "conditional expected information- -are approximated 
here by evaluating (5.1) through (5.3) with 0 in place of 0. 
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7.1 The Data 

Observed responses x are vectors of responses to four items 
from the Arithmetic Reasoning test of the Armed Services 
Vocational Aptitude Battery. (ASVAB) , Form 8A, as observed in the 
sample of respondents from the survey Profile of American Youth 
(U.S. Department of Defense 1982) whose data are reported by 
Mislevy (1986). Response counts are shown in Table 1 for the 
N-776 respondents as a whole, and as broken down into the four 
categories of a demographic design with subsarcple sizes of 
263, 228, 140, and 145. A correct response is indicated by a 1 , 
an incorrect response by a 0. 



Table 1 about here 



7.2 The Model 

Using the numerical procedure described in Mislevy (1987), 
the 2-parameter logistic (2PL) IRT model was fit to the response 
counts in Table 1 by maximizing a loglikelihood of the form of 
(2.1). Under the usual IRT assumption of conditional 
independence, and with f}^ representing the item parameters 
(a^, b^, a^, b^) , we have 

4 

f(x\S 9 fi .) - II exp(1.7a x (0-b ) ] /( 1+exp [ 1 . 7a (0-b .)] ), 
or, equivalently , 
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X. 1-x. 

- n ? Ad) J q (9) J , 

j J J 

15 



where P. (0) - f . (x.-l I $ , a. ,b . ) and Q. (0) - 1-P. (0) . The 
J J J J J J J 

parameters a^ and b^ give the (linear) regression of the logit of 
Xj on 9 . Normal densities were assumed for gC^ly,^), ^ or v = 

1,...,4. The population parameters fi^ ~ (m^^^ ^4 ,a 4^ were 

constrained in order to make the model identified, by 

incorporating the computationally convenient constraints 
2 2 

2 My " 0 and 2(a^ + M^)/^ * 1. The resulting MLE's are shown in 
Table 2. 



Table 2 about here 



From this point on, the MLEs shown in Table 2 will be' treated 
as known true values for the purpose of approximating expected 
information matrices. Using the results of Section 5, expected 
information matrices will be calculated conditional on the 
observed subsample proportions p(y) - (.339, .294, .180, .187). 
The density p^(0) that obtains after fixing y in this manner is 
thus a mixture of four normal components: 



Pje\f3 0 ) - 2 g(B\u t a) p(y) 



y-1 



y y 



Values of y account for about 18 -percent of the variation of 9 in 
the mixture. 
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7 . 3 Formulae 

Let u. be an element of /L . Let W.(#) take the value 
J 1 J 

1.7(0-b.) if u. - a., and the value -1.7a. if u. - b . . The 
J J J J J J 

element of the complete-data gradient vector corresponding to u^ 
is 



s(u..;0,x) - [x.-P..(0)] Wj(^) 



Computing expected information requires the expected count of each 

response pattern x^ - (x^ x^). In subpopulation y, this 

value is 



N^ y - N y f fCxJ*,/^) g(*U y . V 



For the undifferentiated population (given the observed values of 

A A 

y) as a whoL , - 2 N^ y . These values are given in Table 3. 



Table 3 about here 



Expected information matrices are now obtained as follows: 



4(7) - = / s i s i' v'v d9 ' 



where - s 1 ^^) and p <*|xp - fCx^*,/^) p <*|0 2 ) / h (x|/3) 
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2, 



and 

ij - SS N [/ sj p(^|x r y) dtf][ / sj' p(»|x y) dd ] , 
y Jty y 

where p^Jx^y) - fCxJ*,/^) p(*|0 2> y) / Mx^y,/?). 
7.4 Results 

Tables 4. 5, and 6 present x , 1^, s , and l} r . These 

r ©X(y) X(y) Xy 

matrices are block-diagonal, with only off -diagonal elements for 
the a and b parameters of a given item taking possibly non-zero 
values. The proportions of effective information and partial 
recovery for the diagonal elements are summarized in Table 7. 
Compared to the information expected if (x,0) were observed, the 
degree of information expected when only x is observed averages 
36 -percent for a parameters and 85-percent for b parameters. 
Using (x,y) yields corresponding values of 40-percent and 
87 -percent. Averaging over item- level results, the degree to 
which missing information is recoverable (for the observed values 
of y) is 7-percent for a's and 17-percent for b's. 



Tables 4-7 about here 
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Appendix 

Section 4 uses the large-sample result that if the off- 
diagonal block in I corresponding to the crossing of 0^ and fi 0 

"X 1 2. 

is null, so is the corresponding block in I . This appendix gives 

A. 

a heuristic argument in support of that claim. Since the observed 

information matri^ I introduced in Section 6 is a consistent 

estimate of I , it suffices to show that the expectation of the 
x 

off -diagonal block in I is null. 
From Section 6, 

N 

N I x - S [/(-d) p(*|x.,/3) d6 - Jss' p(tf|x.,/?) d$] . 
i 

A 

Substituting 0 for 0, and taking expectation over x gives 

E x I x - E x [E e (-d)] - E x [E e (ss')] . (A.l) 

It is clear from (2.2) that the off -diagonal block of d is null 
for all x and all 0, so the corresponding block of the first term 
on the right of (A.l) is null. The second term is Var (s) , or 

"a 

I , which, from Section 2, also has a null off- diagonal block. 
"X 

The off-diagonal block of the matrix difference between the two 
terms must be null too. 
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Table 1 

Observed Counts of Response Patterns 



Response y 



1 2 


3 


4 


1 


2 


3 


4 


Total 


0 0 


0 


0 


23 


20 


27 


29 


99 


0 0 


0 


1 


5 


8 


5 


8 


26 


0 0 


1 


0 


12 


14 


15 


7 


48 


0 0 


1 


1 


2 


2 


3 


3 


10 


0 1 


0 


0 


16 


20 


16 


14 


66 


0 1 


0 


1 


3 


5 


5 


5 


18 


0 1 


1 


0 


6 


11 


. 4 


6 


27 


0 1 


1 


1 


1 


7 


3 


0 


11 


1 0 


0 


0 


22 


23 


15 


14 


74 


1 0 


0 


1 


6 


8 


10 


10 


34 


1 0 


1 


0 


7 


9 


8 


11 


35 


1 0 


1 


1 


19 


6 


1 


2 


28 


1 1 


0 


0 


21 


18 


7 


19 


65 


1 1 


0 


1 


11 


15 


9 


5 


40 


1 1 


1 


0 


23 


20 


10 


8 


61 


1 1 


1 


1 


86 


42 


2 


4 


134 


Total 






263 


228 


140 


145 


776 



Table 2 





Maximum 


Likelihood 


Estimates 


of P 




Item 


a . 


b. 


y- value 




a 




J 


J 






y 


1 


1.006 


.421 


1 


.485 


1.164 


2 


.672 


.213 


2 


.073 


.855 


3 


. 775 


.139 


3 


- .513 


.642 


4 


.834 


.402 


4 


- . 502 


.640 



28 

22 



Table 3. 

Expected Counts of Response Patterns 



Response y 
1234 1 2 3 4 Total 



0 0 


0 


0 


27 


17 


27. 


85 


31 


70 


32. 


34 


119 


06 


0 0 


0 


1 


4 


88 


5. 


81 


5 


37 


5. 


54 


21 


60 


0 0 


1 


0 


7 


55 


8. 


97 


8 


46 


8 


73 


33 


71 


0 0 


1 


1 


3 


16 


3 


56 


2 


31 


2 


41 


11 


44 


0 1 


0 


0 


12 


76 


15 


04 


14 


69 


15 


13 


57 


62 


0 1 


0 


1 


4 


80 


5 


49 


3 


77 


3 


92 


17 


98 


0 1 


1 


0 


7 


07 


8 


15 


5 


77 


6 


00 


26 


99 


0 1 


1 


1 


6 


00 


5 


55 


2 


35 


2 


46 


16 


36 


1 0 


0 


0 


15 


72 


18 


75 


16 


23 


16 


79 


67 


48 


1 0 


0 


1 


8 


38 


8 


95 


5 


08 


5 


31 


27 


72 


1 0 


1 


0 


12 


.04 


13 


04 


7 


68 


8 


01 


40 


77 


1 0 


1 


1 


14 


.77 


11 


66 


3 


.80 


3 


99 


34 


.22 


1 1 


0 


0 


17 


.86 


19 


.80 


12 


.37 


12 


.90 


62 


.93 


1 1 


0 


1 


19 


.51 


16 


.26 


5 


.76 


6 


.05 


47 


.58 


1 1 


1 


0 


26 


.58 


22 


.79 


8 


.46 


8 


.88 


66 


.71 


1 1 


1 


1 


74 


.75 


36 


.33 


6 


.20 


6 


.55 


123 


.83 


Total 






263 


.00 


228 


.00 


140 


.00 


145 


.00 


776 


.00 



Table 4 

Expected Complete-Data Information 





a l 


b l 


a 2 


b 2 


a 3 


b 3 


a 4 




a 


176.75 




290.81 




254.24 




249.38 




b 


-68.60 


371.64 


-37.80 


201.01 


31.42 


250.70 


75,89 


267.98 



Note: Only diagonal blocks are shown; all other entries are zero. 
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Table 5 

Expected Incomplete-Data Information, Ignoring y 





a i 


b l a 2 


b 2 


a 3 b 3 


a 4 b 4 


a 


51.76 


114.82 




90.72 


95.65 


b 


■55.09 


298.63 -31.91 


179.77 


31.70 215.75 


70.80 224.58 


Note 


Only diagonal blocks 


are shown 


all other entries are zero. 



Table 6 

Expected Incomplete-Data Information, Using y 




a 53.39 119.79 105.18 116.72 

b -46.78 308.91 -22.49 183.28 44.33 222.03 85.86 232.98 



Note: Only diagonal blocks are shown; all other entries are zero. 
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Table 7 

Recovery of Missing Information (Diagonal Elements) 





a l 


a 2 


a 3 


a 4 


average 


I X(y) /I 9X(y) 


.29 


.39 


.36 


.38 


.36 


I Xy /I 6X(y) 


. 30 


.41 


.41 


.47 


.40 


T T 

Xy " ^(y) 


.01 


.03 


.09 


.14 


.07 




r 6X(y) " X X(y) 














b l 


b 2 


b 3 


b 4 


average 


I X(y) /I 8X(y) 


.80 


.89 


.86 


.84 


.85 


I Xy /I 9X(y) 


.83 


.91 


.89 


.87 


.87 


X Xy " X X(y) 


.14 


.17 


.18 


.19 


.17 




r eX(y) " X X(y) 
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